Action recognition by learning pose representations

نویسندگان

  • Alessia Saggese
  • Nicola Strisciuglio
  • Mario Vento
  • Nicolai Petkov
چکیده

Pose detection is one of the fundamental steps for the recognition of human actions. In this paper we propose a novel trainable detector for recognizing human poses based on the analysis of the skeleton. The main idea is that a skeleton pose can be described by the spatial arrangements of its joints. Starting from this consideration, we propose a trainable pose detector, that can be configured on a prototype skeleton in an automatic configuration process. The result of the configuration is a model of the position of the joints in the concerned skeleton. In the application phase, the joint positions contained in the model are compared with the ones of their homologous joints in the skeleton under test. The similarity of two skeletons is computed as a combination of the position scores achieved by homologous joints. In this paper we describe an action classification method based on the use of the proposed trainable detectors to extract features from the skeletons. We performed experiments on the publicly available MSDRA data set and the achieved results confirm the effectiveness of the proposed approach.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sparse Dictionary Learning and Domain Adaptation for Face and Action Recognition

Title of dissertation: SPARSE DICTIONARY LEARNING AND DOMAIN ADAPTATION FOR FACE AND ACTION RECOGNITION Qiang Qiu, Doctor of Philosophy, 2013 Dissertation directed by: Professor Rama Chellappa Department of Computer Science New approaches for dictionary learning and domain adaptation are proposed for face and action recognition. We first present an approach for dictionary learning of action att...

متن کامل

Does Human Action Recognition Benefit from Pose Estimation?

Introduction The earliest works in action recognition focused on tracking body parts and classifying the joint movements. These pose-based approaches, while straight-forward, require accurate tracking of body parts, which is a challenging task in its own right. As recent trends in action recognition have shifted towards natural and unconstrained videos (e.g. films, broadcast sports, Youtube vid...

متن کامل

Real-Time Biologically Inspired Action Recognition from Key Poses Using a Neuromorphic Architecture

Intelligent agents, such as robots, have to serve a multitude of autonomous functions. Examples are, e.g., collision avoidance, navigation and route planning, active sensing of its environment, or the interaction and non-verbal communication with people in the extended reach space. Here, we focus on the recognition of the action of a human agent based on a biologically inspired visual architect...

متن کامل

Deformation-specific and deformation-invariant visual object recognition: pose vs. identity recognition of people and deforming objects

When we see a human sitting down, standing up, or walking, we can recognize one of these poses independently of the individual, or we can recognize the individual person, independently of the pose. The same issues arise for deforming objects. For example, if we see a flag deformed by the wind, either blowing out or hanging languidly, we can usually recognize the flag, independently of its defor...

متن کامل

3D Probabilistic Representations for Vision and Action

Autonomous robots must be able to construct their own representations that enable them to interact successfully with their environment. In less-than-tightly controlled environments, adequate management of (perceptual and action-related) uncertainty is crucial. We present a framework for 3D visual representations that can be learned from visual training data without requiring external supervisio...

متن کامل

Latent Pose Estimator for Continuous Action Recognition

Recently, models based on conditional random fields (CRF) have produced promising results on labeling sequential data in several scientific fields. However, in the vision task of continuous action recognition, the observations of visual features have dimensions as high as hundreds or even thousands. This might pose severe difficulties on parameter estimation and even degrade the performance. To...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1708.00672  شماره 

صفحات  -

تاریخ انتشار 2017